Finite state morphology and information retrieval

نویسنده

  • Kimmo Koskenniemi
چکیده

A source of potential systematic errors in information retrieval is identiied and discussed. These errors occur when base-form reduction is applied with a (necessarily) nite dictionary. Formal methods for avoiding this error source are presented, along with some practical complexities met in its implementation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Error-tolerant Finite State Recognition with Applications to Morphological Analysis and Spelling Correction

This paper presents the notion of error-tolerant recognition with finite-state recognizers along with results from some applications. Error-tolerant recognition enables the recognition of strings that deviate mildly from any string in the regular set recognized by the underlying finite-state recognizer. Such recognition has applications to error-tolerant morphological processing, spelling corre...

متن کامل

Analysis of the Therapists’ Information Behavior in the diagnosis and treatment of mental disorders based on Kuhlthau's information retrieval process model

Background and Aim: Under the influence of various factors, people use different methods and methods to obtain information and express different information behaviors. These behaviors have been introduced in the form of patterns and models of information retrieval by information science experts in recent decades, which can be used in various fields. One of these areas that almost all people are...

متن کامل

Named Entity Based Answer Extraction form Hindi Text Corpus Using n-grams

Most existing systems, are constructed for the English language, such as state-of-art system Watson that win the Jeopardy challenge. While working with Indian languages (i.e. Hindi), a richer morphology, greater syntactic variability, and less number of standardized rules availability in the language are just some issues that complicate the construction of systems. It is also considered a resou...

متن کامل

SPIRE/EPI-SPIRE Model-Based Multi-modal Information Retrieval from Large Archives

In this paper, we describe a new paradigm for information retrieval in which the retrieval target is based on a model. Three types of models – linear, finite state, and knowledge models are discussed. These information retrieval scenarios often arise from applications such as environmental epidemiology, oil/gas production and exploration, and precision agriculture/forestry. Traditional model-ba...

متن کامل

Model-Based Multi-Modal Information Retrieval from Large Archives

In this paper, we describe a new paradigm for information retrieval in which the retrieval target is based on a model. Three types of models – linear, finite state, and knowledge models are discussed. These information retrieval scenarios often arise from applications such as environmental epidemiology, oil/gas production and exploration, and precision agriculture/forestry. Traditional model-ba...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Natural Language Engineering

دوره 2  شماره 

صفحات  -

تاریخ انتشار 1996